PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopim03g098200.0.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family HD-ZIP
Protein Properties Length: 722aa    MW: 79358.8 Da    PI: 6.7006
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopim03g098200.0.1genomeCSHLView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox581.7e-183085156
                        TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
            Homeobox  1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                        +++ +++t++q++ Le+ F+++++p++++r +L++ l L  rq+k+WFqNrR+++k
  Sopim03g098200.0.1 30 KKRYHRHTANQIQKLEAIFKECPHPDEKTRLQLSRDLALAPRQIKFWFQNRRTQMK 85
                        678899***********************************************998 PP

2START182.13.1e-572284532206
                         HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....E CS
               START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla....k 78 
                         +a  a++el+++++ +ep+W+ks     + +n d + + f+++++       ++ea+r+sgvv+m+   lve ++d + +W e ++    k
  Sopim03g098200.0.1 228 IAGNAMEELIRLVQTSEPLWMKSTidgrDVLNFDSYDRVFPRANShlknpnVRIEASRDSGVVTMNGLALVEVFMDAN-KWHEFFPtivsK 317
                         67899*******************99999999999999999988899**999**************************.************ PP

                         EEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEE CS
               START  79 aetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghs 162
                         a tlevissg      ++lqlm+ elq+lsplvp R  +f+R+++q ++g+w+ivdvS d +q+++ ss   ++++lpSg+li++++ng+s
  Sopim03g098200.0.1 318 ARTLEVISSGvmgsrtSTLQLMYEELQVLSPLVPtRQLYFLRFCQQIEQGSWAIVDVSYDITQENQFSSTACKVHRLPSGCLIQDMPNGYS 408
                         *****************************************************************98************************ PP

                         EEEEEE-EE--SSXX.HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
               START 163 kvtwvehvdlkgrlp.hwllrslvksglaegaktwvatlqrqcek 206
                         kvtwvehv+++++   h l+r l++sgla+ga +wv tlqr ce+
  Sopim03g098200.0.1 409 KVTWVEHVEVEEKGLiHRLYRDLIHSGLAFGAERWVGTLQRVCER 453
                         **********987555***************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.25E-182290IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.604.3E-212491IPR009057Homeodomain-like
PROSITE profilePS5007117.0222787IPR001356Homeobox domain
SMARTSM003898.2E-172891IPR001356Homeobox domain
CDDcd000861.62E-162988No hitNo description
PfamPF000463.9E-163085IPR001356Homeobox domain
PROSITE patternPS0002706285IPR017970Homeobox, conserved site
PROSITE profilePS5084848.764218456IPR002913START domain
SuperFamilySSF559611.36E-35219454No hitNo description
CDDcd088753.34E-119222452No hitNo description
SMARTSM002346.8E-50227453IPR002913START domain
PfamPF018523.5E-49228453IPR002913START domain
SuperFamilySSF559613.06E-21474711No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 722 aa     Download sequence    Send to blast
MEYVGTGGGG GTSSGGGADP HDGSDSHRRK KRYHRHTANQ IQKLEAIFKE CPHPDEKTRL  60
QLSRDLALAP RQIKFWFQNR RTQMKAQHER ADNCALRAEN DKIRCENIAI REALKNVICP  120
SCGGPPVTED SYFDEQKLRI ENMQLKEELD KVSSIAAKYI GRPISQLPPV QPIHLSSLDL  180
SMSSFIGHGP NSLDLDLDLL PGSSSNMPSL AYATLNLSDM DKSLIADIAG NAMEELIRLV  240
QTSEPLWMKS TIDGRDVLNF DSYDRVFPRA NSHLKNPNVR IEASRDSGVV TMNGLALVEV  300
FMDANKWHEF FPTIVSKART LEVISSGVMG SRTSTLQLMY EELQVLSPLV PTRQLYFLRF  360
CQQIEQGSWA IVDVSYDITQ ENQFSSTACK VHRLPSGCLI QDMPNGYSKV TWVEHVEVEE  420
KGLIHRLYRD LIHSGLAFGA ERWVGTLQRV CERHACLMVN SNPSHDHGGA IPSPDGKRSM  480
MKLARRMVSN FCASVNPSNG HQWNTISGLN EFEVRATLQK CTDPGQPNGV VISAAATIWL  540
PIPPQQVFNF FRDERTRPQW DVLSNQNPVQ EVAHIANGSH SGNCISVLRA YNTSQNNMLI  600
LQESCIDSSG ALVVYSPVDI PAINIAMSGE DPTYIPLLPS GFTISPDGRQ LDTDAASCSS  660
SSNASTIGGG RSGGSLITVV FQILVSSLPS AKMSPESVNT VNNLIGSTVH QIKSALNCST  720
S*
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC2384700.0AC238470.1 Solanum lycopersicum chromosome 3 clone C03HBa0298P15, complete sequence.
GenBankHG9755150.0HG975515.1 Solanum lycopersicum chromosome ch03, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004235382.10.0PREDICTED: homeobox-leucine zipper protein HDG11
SwissprotQ9FX310.0HDG11_ARATH; Homeobox-leucine zipper protein HDG11
TrEMBLK4BJN50.0K4BJN5_SOLLC; Uncharacterized protein
STRINGSolyc03g098200.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA24902351
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G73360.10.0homeodomain GLABROUS 11